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ABSTRACT 

Filaments are one of the most prominent features visible in the galaxy distribution. 
Considering the Luminous Red Galaxies (LRGs) in the Sloan Digital Sky Survey 
Data Release Seven (SDSS DR7), we have analyzed the filamentarity in 11 nearly two 
dimensional (2D) sections through a volume limited subsample of this data. The galaxy 
distribution, we find, has excess filamentarity in comparison to a random distribution 
of points. We use a statistical technique "Shuffle" to determine Lmax, the largest 
length-scale at which we have statistically significant filaments. We find that Lmax 
varies in the range 100 — 130 h~^Mpc across the 11 slices, with a mean value Lmax = 
110 ± 12/i^^Mpc. Longer filaments, though possibly present in our data, are not 
statistically significant and are the outcome of chance alignments. 

Key words: methods: numerical - galaxies: statistics - cosmology: theory - cosmology: 
large scale structure of universe 



1 INTRODUCTION 

One of the main goals of redshift surveys is to study 
the galaxy distribution . Variou s redshift surveys (e.g . CfA, 



iGeller fc Huchralll989l: LCRS. IShectman et"aLlll99e ; 2dF - 



GRS. IColles et al.ll200ll and SDSS, IStoughton et al 



I2OO2I ) 



all show that galaxies are distributed in an interconnected 
network of clusters, sheets and filaments encircling nearly 
empty voids. This complex network is often referred to as 
the "Cosmic Web" . Quantifying the cosmic web and under- 
standing it's origin is a challenging problem in cosmology. 

The analysis of filamentary patterns in the galaxy dis- 
tribution has a long history dating b ack to a few papers i n 
the late-seventics and mid-eighties by Joeveer et al. (^1973), 

^inasto ct al. (1980), Zcl'dovich , Einasto & Shand arin . 

' 19821) ■ IShandarin fc ZeldovichI (|l983l ) and lEinasto et'al] 
1984) . Filaments are the most striking vis- 
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Are the filaments statistically significant? This is a ques- 
tion that naturally arises when we embark on studying the 
filaments in the galaxy distribution. The possibility that the 
observed filaments are not genuine feature of the galaxy dis- 
tribution and could arise out of chance alignments requires 
us to establish the statistical significance of the filaments. 

The SDSS (lYork et al.l I2OO0I) is current l y the largest 
galaxy redshift survey. iPandev fc Bharadwail l|2005l ) (here- 
after Paper I) have analyzed the filamentarity in the equa- 
torial strips of this survey. These strips are nearly two di- 
mensional (2D). They have projected the data onto a plane 
and analyzed the resulting 2D galaxy distribution. They find 
evidence for connectivity and filamentarity in excess of that 
of a random point distribution, indicating the existence of 
an interconnected network of filaments. The filaments are 
statistically sign ificant upto a length scales of 80/i~^Mpc 
and not beyond (jPandev fc Bharadwail[2005l ). All the struc- 
tures spanning length-scales greater than this length scale 
are the result of chance alignments. These results are con- 
sistent with the earlier findings from Las Campanas Red- 
shift Survey (LCRS) where the filamentarity was found to 
be statistically significant on scales up to 70 -- 80 Mpc 
in t he —3° slice and 50 — 70 Mpc in the other 5 slices 
(Bh aradwai et al]l2004l ). The average filamentarity of the 
galaxy distribution was shown to depend on various physical 
galaxy properties such as the luminosity, colour, morpholog y 
and star formation rate l|Pandev fc Bharadwail[2006l . l2008l ). 
Recently IPandevI (|201G| ) has studied if the statistically sig- 
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nificant length scale of filaments depends on different galaxy 
properties and finds that it does not depend on the galaxy lu- 
minosity, colour and morphology. Analysis with mock galaxy 
samples from N-body simulations indicates that the length 
scale upto which filaments are statistically significant is also 
nearly independent of bias and weakly depends on the num- 
ber density and size of the galaxy samples. 

The measurement of the length scale upto which the 
filaments are statistically significant could be limited by the 
size of the survey, and it is possible that this has been un- 
derestimated in earlier studies with the LCRS and the SDSS 
MAIN galaxy sample. This issue can be addressed repeating 
the analysis with a bigger galaxy survey. Given a magnitude 
limit, Luminous Red Galaxies (LRGs) can be observed to 
greater distances as compared to normal galaxies. Fur- 
ther their stable colors make them relatively easy to pick out 
from the rest of the galaxies using the SDSS multi-band pho- 
tometry. The LRG sample extends to a much deeper region 
of the Universe as compared to the SDSS MAIN galaxy sam- 
ple. The very large region covered by the SDSS LRG sample 
provides us the ideal opportunity to investigate the size of 
the longest filaments present in the Universe. 

In this work we measure the largest lengthscale at which 
we have statistically significant filaments in the distribution 
of LRGs in the SDSS DR7. We discuss the data and method 
of analysis in Section 2, and the results and conclusions are 
presented in Section 3. 

Throughout our work, we have used the fiat ACDM 
cosmology with Q,m = 0.3, Q,a = 0.7 and h = 1. 

2 DATA AND METHOD OF ANALYSIS 
2.1 SDSS data 

The Sloan Digital Sky Survey (SDSS) l|York et aLlbOOd l is a 
wide-field imaging and spect roscopic survey of the sky using 
a dedicated 2.5 m telescope ijGunn et al.ll2006l l with 3° field 
of view at Apache Point Observatory in southern New Mex- 
ico. The SDSS has imaged the sky in 5 passbands, u, g, r, i 
and z covering 10* square degrees and has so far made 7 data 
releases to the community. The SDSS galaxy sample can be 
roughly divided into two (i) the MAIN galaxy sample and 
(ii) the Luminous Red Galaxy (LRG) sample. For our work, 
we have used the spectroscopic s ample of LRGs derived from 
the Data Release 7 of SDSS-II (|Abazaiian et al.ll2009l ). 

SDSS targets those galaxies for spectroscopy which 
have r magnitude brighter than 17.77 (r < 17.77). To 
select LRGs, additional galaxies are targeted using color- 
magnitude cuts in g, r and i which extends the magni- 
tude limit for LRGs to r < 19.5. The prominent feature 
for an early- type galaxy is the 4000^4° break in its SED. 
For 2 < 0.4 this feature lies in the g passband while it shifts 
to the r band for higher redshifts. Hence selection of LRGs 
involves different selection criteria b elow and above z < 0.4 , 
the details of which are described in lEisenstein et.al.1 mOl\ ) 
(hereafter EOl). 

The criteria for lower redshifts are collectively called 
Cut-I (eq. (4-9) in EOl) while those for the higher red- 
shifts are called Cut-II (eq. (9-13) in EOl), with Cut-I 
accounting for ~ 80% of the targeted LRGs. A galaxy 
that passes either of these cuts is fiagged for spectroscopy 
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Figure 1. The density of LRGs as a function of radial distance r. 
The density has been computed using shells of uniform thickness 
28.75/i~^Mpc in the radial direction. The dashed line shows the 
mean density of our LRG sample. 



by the SDSS pipehne as TARGET_GALAXY_RED while 
a galaxy that passes only Cut II is flagged as TAR- 
GET_GALAXY_REDJI. Hence, while selecting the LRGs 
we require that the TARGET_GALAXY_RED and TAR- 
GET_GALAXY_REDJI flags both be set. 

We obtain the g-band absolute magnitude from the r 
band apparent magnitude accounting for the k-correction 
and passive evolution. We use the prescription for K + e cor- 
rection from the Table 1 in EOl (non-star forming model). 
The cuts deflned above are designed to produce an ap- 
proximately volume limited sample of LRGs upto z « 0.4. 
The comoving number density of LRGs falls sharply beyond 
z ~ 0.4. It is obviously necessary to include LRGs that come 
from the MAIN sample for z < 0.3. But EOl issue a strong 
advisory against selecting LRGs with « < 0.15 for a volume 
limited sample as the luminosity threshold is not preserved 
for low redshifts. We have restricted our LRG sample to the 
redshift range 0.16 < z < 0.38 and g-band absolute mag- 
nitude range —23 < Mg < —21. Figure [T] shows the LRG 
number density as a function of the radial distance r for the 
r range that we have used in our analysis. We see that the 
LRG density has a maximum variati on of ~ 30% around 
the m ean density. An earlier study ((Pandev Sz Bharadwail 
l2006l ) indicates that a density variation of this order is not 
expected to produce a statistically signiflcant effect on the 
average fllamentarity. The earlier study also indicates that 
the length scale upto which the fllaments are statistically 
signiflcant has a very weak dependence on the density. 

Figure [2] shows the geometry of the sky coverage of 
the region that wc have analyzed in survey coordinates A 
and rj (Stoughton et al. 2002 ). For our analysis we have ex- 
tracted 11 strips that are described in the Table 1. Each strip 
spans 100° in A (—50 < A < 50) and 3° in rj {rj ranges are 
shown in Table 1), and radially extends from 462/i~^Mpc to 
1037/i~^Mpc. Adjacent strips are separated by 3°. The phys- 
ical thickness of each strip increases with radial distance. We 
have extracted strips of uniform thickness 24/i~^Mpc which 
corresponds to 3° at a radial distance of 462/i~^Mpc. The 
volume outside these uniform thickness strips was discarded 
from our analysis. 
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Figure 3. The left panel shows the projected 2D galaxy distribution in one of the LRG strips. The right panel shows a shuffled realization 
generated from the same data using L = 90/i~^Mpc. The corresponding 90/i~^Mpc X 90/i~^Mpc grid is overlaid. The blocks defined by 
this grid were randomly interchanged with a rotation. Only blocks that are entirely within the survey area are used in this process. 
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Figure 2. The geometry of the If LRG strips extracted from 
SDSS DR7 plotted in survey coordinates. 



Table 1. Details of the LRG samples extracted from SDSS DR7 



Strip name 






n 


SI 


-33.5 


-30.5 


1211 


82 


-27.5 


-24.5 


1332 


S3 


-21.5 


-18.5 


1477 


S4 


-15.5 


-12.5 


1513 


S5 


-9.50 


-6.50 


1590 


S6 


-3.50 


-0.500 


1568 


S7 


2.50 


5.50 


1677 


S8 


8.50 


11.5 


1601 


S9 


14.5 


17.5 


1422 


810 


20.5 


23.5 


1169 


811 


26.5 


29.5 


1356 



2.2 Method of Analysis 

The strips that we have analyzed extend 575/i~^Mpc in 
the radial direction and ~ 800/i~^Mpc (or more) in the 
transverse direction, while the thickness is only 24/i^^Mpc. 
These strips can be treated as nearly two dimensional 
which makes the analysis relatively simpler. The strips 
were all collapsed along the thickness (the smallest dimen- 
sion) to produce a 2D galaxy distributions (Figure [3]). We 



use the 2D "Shapefinder" statistic iBharadwai et al.ll2000l ) 
to quantify the filamentarity of the patterns in the re- 
sulting gal axy distribution. A d e tailed discussion is pre- 
sented in IPandev Sz Bharadwail (|2006l ). and we present 
only the s alient features h ere. The read er is referred to 
ISahni. Sat hvaprakash.fc Sha ndarinI l|l998h for a discussion 
of Shapefinders in three dimensions (3D). 

The galaxy distribution was embedded in a 2D mesh 
760/i~^Mpc X 1608/!.~^Mpc with cell size 2/!.~^Mpc x 
2/i~^Mpc. The entire galaxy distribution was represented 
as a set of Is and Os on the mesh. Cells containing a 
galaxy were assigned a value 1, and the empty cells were 
assigned a value 0. Note the difference in grid size from the 
IPandev fc Bharadwail l|2005l ') where a 1 Mpc x 1 Mpc 
cell was used. This takes into account the fact that our 
LRG strips have projected galaxy number density ~ 1.9 x 
10 
10 

We use the 'Friends-of-Friend' (FOF) algorithm to iden- 
tify connected regions of filled cells which we refer to as clus- 
ters. The filamentarity of each cluster is quantified using the 
Shapefinder T which defined as 



"^(Mpcft-^)" 
"^(Mpcft-^)- 



^which is considerably smaller than ~ 9 x 
^ in the samples analyzed in Paper I. 



(P^ - 16S) 
{P - 4/)2 



(1) 



where P and 5 are respectively the perimeter and the area 
of the cluster, and I is the grid spacing. The Shapefinder T 
has values and 1 for a square and filament respectively, 
and it assumes intermediate values as a square is deformed 
to a filament. We use the average filamentarity 



Fo = 



(2) 



to asses the overall filamentarity of the clusters in the galaxy 
distribution. 

The distribution of Is corresponding to the galaxies is 
sparse. Only < 1% of the cells contain galaxies and there 
are very few filled cells which are interconnected. As a con- 
sequence FOF fails to identify the large coherent structures 
which correspond to filaments in the galaxy distribution. 
We overcome this by successively coarse-graining the galaxy 
distribution. This is achieved by gradually making the filled 
cells fatter. In each iteration of coarse- graining all the empty 
cells adjacent to a filled cell (i.e. cells at the 4 sides and 4 
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corners of a filled cell) are assigned a value 1 . This causes 
clusters to grow, first because of the growth of individual 
filled cells, and then by the merger of adjacent clusters as 
they overlap. Coherent structures extending across progres- 
sively larger length-scales are identified in consecutive itera- 
tions of coarse-graining. So as not to restrict our analysis to 
an arbitrarily chosen level of coarse-graining, we study the 
average filamentarity after each iteration of coarse-graining. 
The filling factor FF quantifies the fraction of cells that are 
filled and its value increases from ~ 0.01 and approaches 1 
as the coarse-graining proceeds. We study the average fila- 
mentarity F2 as a function of the filling factor FF (Figure 
|4| as a quantitative measure of the filamentarity at different 
levels of coarse-graining. The values of FF corresponding to 
a particular level of coarse-graining shows a slight variation 
from strip to strip. In order to combine and compare the 
results from different strips, for each strip we have interpo- 
lated F2 to 7 values of FF at a uniform spacing of 0.1 over 
the interval 0.05 to 0.65. Coarse-graining beyond FF ~ 0.65 
washes away the filaments and hence we do not include this 
range for our analysis. For comparison we also consider a 
random reference sample generated by randomly distribut- 
ing points over a 2D region with exactly the same geometry 
and number density as the projected LRG samples. We find 
that the filamentarity of the LRG data is in excess of that 
of the random samples (Figure [l]). While this is reassuring 
that we are studying a genuine signal, it should be inter- 
preted with some caution. It is possible that the enhanced 
filamentarity observed in the actual data would also arise if 
the observed two-point clustering were incorporated in the 
random samples even in the absence of higher order clus- 
tering. In this paper we use a statistical technique called 
Shuffle, which does not rely on externally generated ran- 
dom samples, to establish the length-scale upto which the 
observed filamentarity is statis tically significant . Shuf fle was 



first introduce d and applied bvlBhavsax fc Lind(ll988h. Sub 



sequent work (jBharadwai et al 



l2004l : |Pandev fc Bharad"TOil 



l2005l : |Pandevll20ld ) has extended this and applied it to the 
LCRS, the SDSS Main galaxy sample and N-body simula- 
tions. 

A grid with squares blocks of side L is superposed on the 
original data slice (Figure O). The blocks which lie entirely 
within the survey area are then randomly interchanged, with 
rotation, repeatedly to form a new shuffled data. The shuf- 
fling process eliminates coherent features in the original data 
on length-scales larger than L, keeping structures at length- 
scales below L intact. All the structures spanning length- 
scales greater than L that exist in the shuffled slices are the 
result of chance alignments. At a fixed value of L, the aver- 
age filamentarity in the original sample will be larger than in 
the shuffled data only if the actual data has more filaments 
spanning length-scales larger than L, than that expected 
from chance alignments. The largest value of L, Lmax, for 
which the average filamentarity of the shuffled slices is less 
than the average fllamentarity of the actual data gives us 
the largest length-scale at which the fllamentarity is statis- 
tically signiflcant. Filaments spanning length-scales larger 
than I/MAx arise purely from chance alignments. 

For each value of L we have generated 24 different re- 
alization of the shuffled slices. To ensure that the edges of 
the blocks which are shuffled around do not cut the actual 
fllamentary pattern at exactly the same place in all the re- 
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Figure 4. The Average Filamentarity {F2) as a function of the 
Filling Factor (FF). The mean and I — cr errors from 11 strips 
are shown for both the LRG strips and the corresponding random 
realisations. 




Filling Factor (FF) 

Figure 5. The Average Filamentarity (F2) as a function of the 
Filling Factor (FF) for strips SI (left) and S2 (right) of the LRG 
data. The effect of shuffling with L = 10h~^ Mpc is shown for 
both strips. The other value of L corresponds to Lmax which is 
different for the two strips. Notice that at Lmax the actual data 
points lie just outside the 1 — cr error-bars of the shuffled data. 
For the shuffled data we have shown the mean and 1 — cr errors 
from 24 independent realizations of the shuffling process. 



alizations of the shuffled data, we have randomly shifted the 
origin of the grid used to deflne the blocks. The values of 
FF and F2 in the 24 realizations differ from one another and 
from the actual data at the same stage of coarse-graining. 
So as to be able to quantitatively compare the shuffled real- 
izations with the actual data, results from the shuffled data 
were interpolated in the same way as actual data. The mean 
[Shuffled] and the variance (AF2 [Shuflled])^ of the aver- 
age filamentarity was determined for the shuffled data at 
each value of FF using the 24 realizations. The difference 
between the filamentarity of the actual data and its shuffled 
counterparts was quantified using the reduced per degree 
of freedom 



2 1 "p 



(f2 [Actual] - f2[Shuflaed]))^ 
(AF2 [Shuffled])! 



(3) 



where the sum is over different values of the filling factor 
FF. 
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Figure 6. This shows x^/^ ^ ^ function of L for all the 11 LRG 
strips that we have analyzed. 



3 RESULTS AND CONCLUSIONS 

Figure [S] shows the effect of shuffling the LRG data for 2 
of the 11 strips. We see that shuffling with L = 10/i~^Mpc 
causes F2 to drop considerably relative to the unshuffled 
data. It is thus established that the filaments are statisti- 
cally significant at this length-scale. We have varied L in 
steps of 10/i~^Mpc form L = 10/i"^Mpc to 160/i"^Mpc. 
We find that the difference between the actual and the shuf- 
fled data is reduced {ie. F2 increases) as L is increased. The 
filamentarity in the actual data and its shuffled counterparts 
are consistent for L = 160/i^^Mpc indicating that at this 
length-scale the filaments are not statistically significant but 
are the outcome of chance alignments. All the 11 LRG strips 
analyzed here show a similar behaviour. 

We consider x^/'^ fo quantify the difference between 
the filamentarity of the actual data and its shufHed coun- 
terparts. This difference is not considered to be statistically 
significant if x^/*^ ~ 1- For sill the 11 strips, Figure [6] shows 
/i' for different values of L. The value of /'^ f^-Us with 
increasing L. We define Lmax as the largest L value for 
which there is a statistically significant difference caused by 
shuffiing. Increasing L beyond Lmax causes x^ /'^ fo fall such 
that X^ ~ 1- Considering the particular strip shown in the 
left panel of Figure[5l we see that Lmax = 100 h'^Mpc i.e. 
there is a statistically significant difference between the av- 
erage filamentarity of the actual data and its shufHed coun- 
terparts for L < Lmax = 100/i~^Mpc but not beyond. The 
value of Lmax varies in the range 100 — 130 across the 11 
strips that we have analyzed. We have averaged the val- 
ues of Lmax measured in the 11 different strips to find that 
Lmax = 110±12 h~^Mpc. This is the largest length-scale at 
which we have statistically significant filaments in our LRG 
strips. Filaments longer than this, though possibly present 
in the data, are the outcome of chance alignments. 

The value of Lmax estimated using the LRGs is 
somewhat larger than the values in the range 70 — 
80/t~^Mpc obtained i n earlier studies using the LCRS 
( Bharadwai et al."2004ll and the SDSS MAIN galaxy sample 
( Pandev fc B haradwai 200d ). This is possibly a consequence 
of the fact that LRG strips cover a much larger area as com- 



pared to the ones analyzed earlier. For e xamp le, the SDSS 
strips analyzed in lPandev fc Bharadwail (|2005l ) have an ex- 
tent of 336 and 373 h~^Mpc in the radial and transverse di- 
rections respectively, in comparison to 575 and 800ft~^Mpc 
for the strips analyzed here. The larger area ensures a better 
mixing of the blocks in the shufffing process. This is partic- 
ularly important at large L where we had very few blocks in 
the smaller strips that had been analyzed earlier. The larger 
area also ensures that the present estimate of Lmax is less 
likely to be infiuenced by local effects, and hence is more 
representative of the global value. 

Finally we no te that tests with controlled mock sam- 
ples (|Pandevll2010l ) show that the 2D analysis adopted here 
tends to shorten the length of 3D filaments. Further, 3D 
sheets will appear as filaments in 2D. It is thus necessary to 
be cautious in interpreting the consequence of our finding 
in terms of the full 3D galaxy distribution. It is reasonable 
to interpret Lmax = 110 ± 12/i~^Mpc as an order of mag- 
nitude estimate of the largest length-scale at which we have 
statistically significant structures (sheets and filaments) in 
the 3D LRG distribution. 
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